# RLHF Fine-tuning
Dmind 1
MIT
DMind-1 is a Web3 expert model built upon Qwen3-32B, optimized for the Web3 ecosystem through supervised instruction fine-tuning and human feedback reinforcement learning, achieving significant improvements in task accuracy, content safety, and expert-level interaction alignment.
Large Language Model
Transformers Supports Multiple Languages

D
DMindAI
129
21
Llama 3.2 1B GGUF
Llama 3.2 is a collection of 1B and 3B parameter-scale multilingual generative models released by Meta, optimized for dialogue scenarios and supporting various language tasks.
Large Language Model Supports Multiple Languages
L
Mungert
643
3
Llama 3.1 Nemotron 70B Instruct HF
A custom large language model by NVIDIA, designed to enhance the usefulness of responses generated by LLMs to user queries.
Large Language Model
Transformers English

L
nvidia
29.98k
2,033
Llama 3 8B Japanese Instruct
This is a Meta-Llama-3-8B-Instruct model fine-tuned on Japanese dialogue datasets, specializing in Japanese conversational tasks.
Large Language Model
Transformers Supports Multiple Languages

L
haqishen
33
22
Eleuther Pythia6.9b Hh Sft
Apache-2.0
A causal language model based on the Pythia-6.9b foundation model, fine-tuned using Anthropic's hh-rlhf dataset for supervised training
Large Language Model
Transformers English

E
lomahony
58
1
Llama 2 7b Hf
Llama 2 is a 7-billion-parameter pre-trained generative text model developed by Meta, part of the open-source large language model series
Large Language Model
Transformers English

L
meta-llama
914.57k
2,038
Bloom 560m RLHF SD2 Prompter
Openrail
An RLHF fine-tuned Stable Diffusion 2.0 prompt generation model that can automatically expand or generate high-quality image descriptions
Text Generation
Transformers

B
crumb
31
12
Featured Recommended AI Models